As the song “Happy” by Pharrell Williams goes, “clap along if you know what happiness is to you”. People could be happy for different reasons. However, do people realize what made them happy today? To figure this out, I investigated the happy moment of people who have different genders, marital status on different categories. A research about happiness analysis follows.
First step for the project is to preprocessing data, which would be able to use for further analysis. (The process was done in the other rmd file named “Text_Processing” and the output was saved in the output file, which could be used directly.)
After looking into data, I found that there are some observations do not make any sense (for example, there are 51 people who have age 233.0) for study use. Thus, I removed those data.
After preprocessing data, what makes people happy? I started the research with overall data exploration on categories of happiness. Thus, I generate a wordcloud which gives the most popular words that make people feel happy.
From the keywords, “friend”, “family” and some words describe entertainment such as “watched” showed a higher popularity. They seem to belong different categories which could make people happy. Thus, I chose to seperate data by category, which would show the main reason that makes people happy. I drew a pie chart and a frequency histogram, which showed the proportion and the number of each category.
According to the results, we got more HappyDB data from category achievement and affection much more than other category, and there is no huge disparity between the two categories (achievement and affection). So in further study, we can regard them as half-to-half roughly, thus our analysis can go on without big deviation from sampling, the idea is the same for gender and marital status.
By studying each category grouped by gender, we can see that compared with other categories, nature and affection seem to be the categories that does not make great difference between males and females because the proportion in these two categories are almost equally divided. Note that the total number of females is much less than the total number of males in the dataset hm_data, which means that in these two categories, it would be more likely that females will be happy because of affection or nature, or we can say that female put more emphasis on their emotions. On the contrary, among the group of exercise or even the group of achievement, males have a much larger proportion, which means that people who are happy because of exercise or achievement, male could be much easier to find happiness. That is similar to the stereotype of the traditional feeling of females and males.
To further study the difference between males and females, we plot the word frequency to show the difference between males and females on achievement and affection.
As we can see from the word frequency plot, achievement for males and females shows some differences. For example, males would show higher happiness from “basketball”, “bike” or “golf”, which belong to competition, while females are easier to be happy as a result of their appearance such as “dress”, “clothes” and “bag”.
For affection, both groups show a very high concentration on family or relationship, such as words “family”, “wife”, “husband”.
Besides gender, marriage might also be another important factor which could influence the reason that makes people feel happy.
As we see from the wordcloud, the most popular words from single group are entertainment like “played”, “watched” and “game”. However, after marriage, people would pay more attention on their family using more words like “son”, “daughter”, “home” and etc.
To figure out more details about the key words for two groups, I studied the correlation between words, where I tried to use network analysis.
As the results show, both groups (single and married) have a center “friend”. However, after people get married, “friend” seems to be not the only center. Words connected to family such as “son”, “daughter” also become centers. That shows that the hapiness of married people is getting complicated. In other words, the center of happiness for single is “friend” while married people could also be more willing to have happy memories with their family.
To compare the two biggest categories (achievement and affection) which make people happy, I tried to compare their word usage.
From the bar plot above we can get that the main difference between achievement and affection for single. Some words will be used much more in affection, like “uncle”, “mom”, “sister”, which is connected to family, while “bonus” and “feedback” would be higher used in achievement part.
For married people, family is still an important topic for affection part, even though achievement still has a large weight on career, words like “mturk” (which is a crowdsourcing Internet marketplace) became the most popular, which are more likely to help people work at home.
To sum up, there are several observations of the study:
First, among 7 categories, it looks like people’s happiness concentrates more on themes of achievement and affection.
For different genders, each category shows a divergence. For example, females are more likely to be happy because of affection including their emotions on family (husband) or relationship (boyfriend) and their appearance (clothes, dress). On the other hand, it might be more easier to be happy for males in their competition (basketball, won) or career (performance).
As for different marital status, married people seem to focus more on their family. Friendship is always a central topic, but after marriage, people also start to use more family connected words (“son”, “daughter”). And even for the categories of achievement and affection, we can see a slight moving trend from career to family in one’s achievement words that make people happy.